The role of feature selection in building pattern recognizers for computer aided diagnosis

نویسندگان

  • Clay Spence
  • Paul Sajda
چکیده

In this paper we explore the use of feature selection techniques to improve the generalization performance of pattern recognizers for computer aided diagnosis CAD We apply a modi ed version of the sequential forward oating selection SFFS of Pudil et al to the problem of selecting an optimal feature subset for mass detection in digitized mammograms The complete feature set consists of multi scale tangential and radial gradients in the mammogram region of interest We train a simple multi layer perceptron MLP using the SFFS algorithm and compare its performance using a jackknife procedure to an MLP trained on the complete feature set features Results indicate that a variable number of features is chosen in each of the jackknife sets and the test performance Az using the chosen feature subset is no better than the performance using the entire feature set These results may be attributed to the fact that the feature set is noisy and the data set used for training testing is small We next modify the feature selection technique by using the results of the jackknife to compute the frequency at which di erent features are selected We construct a classi er by choosing the top N features selected most frequently which maximize performance on the training data We nd that by adding this hand tuning component to the feature selection process we can reduce the feature set from to features and at the same time have a statistically signi cant increase in generalization performance p

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

سیستم کمک تشخیصی کامپیوتری جهت آشکارسازی بیماری انسدادی ریه در تصاویر سی‌تی‌اسکن

Background: Chronic Obstructive Pulmonary Disease (COPD) is one of the most prevalent pulmonary diseases. Use of an automatic system for the detection and diagnosis of the disease will be beneficial to the patients' treatment decision-making process. In this paper, we propose a new approach for the Computer Aided Diagnosis (CAD) of the disease and determination of its severity axial CT scan ima...

متن کامل

A Novel Fuzzy-C Means Image Segmentation Model for MRI Brain Tumor Diagnosis

Accurate segmentation of brain tumor plays a key role in the diagnosis of brain tumor. Preset and precise diagnosis of Magnetic Resonance Imaging (MRI) brain tumor is enormously significant for medical analysis. During the last years many methods have been proposed. In this research, a novel fuzzy approach has been proposed to classify a given MRI brain image as normal or cancer label and the i...

متن کامل

Intelligent application for Heart disease detection using Hybrid Optimization algorithm

Prediction of heart disease is very important because it is one of the causes of death around the world. Moreover, heart disease prediction in the early stage plays a main role in the treatment and recovery disease and reduces costs of diagnosis disease and side effects it. Machine learning algorithms are able to identify an effective pattern for diagnosis and treatment of the disease and ident...

متن کامل

A Random Forest Classifier based on Genetic Algorithm for Cardiovascular Diseases Diagnosis (RESEARCH NOTE)

Machine learning-based classification techniques provide support for the decision making process in the field of healthcare, especially in disease diagnosis, prognosis and screening. Healthcare datasets are voluminous in nature and their high dimensionality problem comprises in terms of slower learning rate and higher computational cost. Feature selection is expected to deal with the high dimen...

متن کامل

Diagnosis of Breast Cancer Subtypes using the Selection of Effective Genes from Microarray Data

Introduction: Early diagnosis of breast cancer and the identification of effective genes are important issues in the treatment and survival of the patients. Gene expression data obtained using DNA microarray in combination with machine learning algorithms can provide new and intelligent methods for diagnosis of breast cancer. Methods: Data on the expression of 9216 genes from 84 patients across...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004